CDS

Accession Number TCMCG020C01084
gbkey CDS
Protein Id RAL48764.1
Location complement(join(3944239..3944318,3944619..3944791,3944891..3944971,3945447..3945512,3945664..3945736,3945914..3945953,3947183..3947255,3947436..3947564,3947694..3948454,3948606..3948699,3948833..3948870,3948957..3949083,3949207..3949469))
Organism Cuscuta australis
locus_tag DM860_001084

Protein

Length 665aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA394036, BioSample:SAMN07347267
db_source NQVE01000097.1
Definition hypothetical protein DM860_001084 [Cuscuta australis]
Locus_tag DM860_001084

EGGNOG-MAPPER Annotation

COG_category L
Description DNA binding domain with preference for A/T rich regions
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K15200        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGAGAGGGAGGAAAGCCAAGGGTAGCGAACAAGCTGAGCAGCGGCACCAGTCTGAAGCCGTGCTGCTTCTTCCGGAGACTCGGGAGGTGGAAGAACCCGGTGAGGCTCATCCTGGATTTGAGGTATCGTTTTTTGATTATTCAGTTGAAAATCACTTTAGAGCTATTGATACTGCCCGGAAACTATGCGGGGAGCCGGATATTGATGATTCTATTGATCAAGAGGAGCTTCAACGATTTGGTTCTTCCATCACATTCCTTTCGGAATGGAGATATTTAAAATACAAATCAAGAAAAATAAGGTTTGCTTCTGAAAGTGAGAATGGTAATGGGAAAGATGTCAAATGTGAAATTATCTTGCCTCAATTTTCTGCCACAACTGTTCCCAAGGGGACCTCTCAGGAGAAAGTATCTTCTCCACAATCCTGCAATGACCTTGTACTCTATGTTGGAGGTTCTGTTTGGGGCATAGACTGGTGTCCCAGAGCATGTAAGGAATCTGAGTTTCTCTTCCAAAGTGAGTTTGTGGCCATTGCTGCTCATCCGCCTCAATCTTCATATCATAAGATTGGTGCCCCTCTTACTGGCAGGGGTTTCATTCAGATATGGTGTTTGTTGAATCACAGAGTAAAAGATGAGTCGTCCCAAGATGATAAAAAGTTGCGAAAAAAGTCAAGTAAAGGTGAGATAGTTAAGATCAAATCACCTGATCCAAAAAAACCCAGAGGAAGACCCAGGAAGAAACCTTTAAATGTGTCATCAGATGATAAACATGGTGATGAAAATGTGCAACAACCACTTGCAATTGAATATCCTGAAGAATCATCCCCACTTCCCACCACAGGCGACATGGCTTCTGAAAACATCAACAAATCACGAGAAGACTCTAGAAGGAAGCAGGAGGTAACTGAACAGCTACCGCTGACTGCTAAAACTTCTTCAAAACGCAGAAAATTGAATAACAATTCTAGAACAAGCAGCCAGACTTGTGGTTCTGCTTTACCCTTTTTATCATGGGATACAAATGAAAAGTCTTCTTCCATTATTGGTTGTCAAACCTCGCAATGTTGTGCTCTCATGTCTATTGAATCAAGTGGTAATGATACAGCTCTCATGCAAACGATTCCCAATGGTCTTGCTTTACCAAGAATGGTACTGTGTTTGGCTCACAATGGAAAAGTAGCATGGGACATTAAGTGGCGATCATGCCATCTTTCTTGCTCCGAGTCTAGACTGAGAATGGGTTATCTTGCTGTTTTGCTGGGAAGTGGAGCTCTAGAAGTGTGGGAGGTCCCTTTTCCTCGCATAATAAAACGGATTTATTCATCAAACATGGAGGGTACCGATCCTCGATTTTTGAAGTTGGAACCAGTGTTTAGATGTTCTATGCTAAAGTGTGGTGATAGGCAAAGTATTCCTTTAACAGTGGAGTGGTCAATGTCATCCTCACGTGATATGATTCTAGCTGGATGTCATGATGGAGTGGTTGCCTTGTGGGTGTTTTCTACTACAAATTCTTCTAAAGACACAAGGCCTTTGCTTTGCTTCAGTGCAGATACAGTGGCCATAAGGTCACTTGCTTGGGCACCATTTGAAAGTGGTACCGAGAGTGATAATGTGGTCATCACTGCTAGTCATAAGGGCTTAAAGTTTTGGGACCTACGTGACCCATTCCATCATTTGCGAGAATTCAATCCTGGACAAGGGGTGGCTATATATAGCCTGGATTGGCTGCCATATCCAAGGTGCATTCTTGTATCGTGTGATGACGGATCCATACGGATTCAGAGTTTGGTAAAGGCTTCCAATGACTTCCCTGTCACTGGAAAGCCGATCCCCATATCCAAACAACAAGGATTTCACACCTATGAGCTGTCATCCTTTGCAATATGGAGTCTGCAAACTTCACGGCTTACAGGTGTGGCCGCATATTGCAGTGCTGATGGTACCACTGCCTATTTCCAGGTTTATTGCTCATATTCATATTATTTAAATTAA
Protein:  
MRGRKAKGSEQAEQRHQSEAVLLLPETREVEEPGEAHPGFEVSFFDYSVENHFRAIDTARKLCGEPDIDDSIDQEELQRFGSSITFLSEWRYLKYKSRKIRFASESENGNGKDVKCEIILPQFSATTVPKGTSQEKVSSPQSCNDLVLYVGGSVWGIDWCPRACKESEFLFQSEFVAIAAHPPQSSYHKIGAPLTGRGFIQIWCLLNHRVKDESSQDDKKLRKKSSKGEIVKIKSPDPKKPRGRPRKKPLNVSSDDKHGDENVQQPLAIEYPEESSPLPTTGDMASENINKSREDSRRKQEVTEQLPLTAKTSSKRRKLNNNSRTSSQTCGSALPFLSWDTNEKSSSIIGCQTSQCCALMSIESSGNDTALMQTIPNGLALPRMVLCLAHNGKVAWDIKWRSCHLSCSESRLRMGYLAVLLGSGALEVWEVPFPRIIKRIYSSNMEGTDPRFLKLEPVFRCSMLKCGDRQSIPLTVEWSMSSSRDMILAGCHDGVVALWVFSTTNSSKDTRPLLCFSADTVAIRSLAWAPFESGTESDNVVITASHKGLKFWDLRDPFHHLREFNPGQGVAIYSLDWLPYPRCILVSCDDGSIRIQSLVKASNDFPVTGKPIPISKQQGFHTYELSSFAIWSLQTSRLTGVAAYCSADGTTAYFQVYCSYSYYLN